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ON  THE  POYER  FUNCTION  OF  A  FIGN  TEST 


FORMED  BY  USING  SUBEAMPLE8 
By  John  E.  Y'elsh 
Douglas  Aircraft  Co.,  Inc. 
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1.  Summary.  The  sign  test  can  be  used  to  obtain  signifi¬ 
cance  tests  for  the  population  median  under  extremely  general 
conditions.^  One  disadvantage  of  the  sign  teat  for  the  median  is 

the  limited  nUpber  of  suitable  significance  levels  available  for 

\ 

a  given,  ^nuaber 'of  observations.  If  the  observations  ere  dravn  at 
several  subsets  of  specified  sizes,  however,  a  variation  of  tite 
sign  test  can  be  applied  using  order  statistics  of  order  ttntlstica 
of  these  subsets.  This  test  furnishes  a  much  wider  variety  of 
suitable  significance  levels  and  Is  valid  under  the  same  conditions 
as  the  sign  teat.  The  purpose  of  thlsHfie^  Is  to  investigate  the 
power  efficiency  of  the  significance  tests  for  the  median  formed 
in  this  way  for  the  particular  case  In  which  each  observation  Is 
drtwn  from  the  same  normal  population.  For  the  cases  considered. 

It  is  found  that  the  power  efficiency  Is  almost  always  decreased 
by  using  two  or  more  subsets  rather  than  all  the  obsirvetlons  as 
a  single  set;  sometimes  this  decrease  In  power  efficiency  Is  very 
large.  Also,  for  a  given  significance  level,  It  Is  found  that  the 
power  efficiency  can  vary  noticeably  with  the  manner  In  which  the 
test  Is  formed. 

\. 

2.  Statement  of  conditions  and  tests*  Consider  n  independfet’t 
observations  drawn  from  n  (possibly  different)  populations  which 
sa'^ifify  the  eondltlene 

\ 


--- 
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(i)  Been  population  has  a  unique  median 
(il)  Each  population  is  continuous  at  the  meclan 
(i.e.  its  cdf  is  continuous  at  the  median). 

(ill)  The  median  of  each  population  has  the  same  Talue  f 
The  following  significance  tests  compare  p  with  a  given  value  ^ 
through  use  of  these  n  observations. 

Let  x(l|n),...,  x(njn)  represent  the  valuegof  the  n  observa¬ 
tions  arranged  in  increasing  order  of  magnitude.  Then  the  one- 
sloed  sign  test  of/’c^ls  defined  by 

(1)  Accept  3t(i|n)c 

The  significance  level  of  this  test  is  equal  to 

(2)  PrfxdlnXf].  . 

Itl"  (n-f)I 

The  one-sided  sign  test  of  ia  defined  by 

(3)  Accept if  x(D4>l-i  In) . 

The  significance  level  ol  this  test  is  also  given  by  (2). 

An  equal  tell  sign  test  of  ^  ia  given  by 

(4)  if  either  xii\n)<p^  oy  x(ntl-l  |n)  ,  (i>n  1) 

The  significance  level  of  this  test  is  equal  to  twice  the  value 

of  (2). 

Now  suppose  that  the  n  independent  observations  were  drawn  as 
r  subsets,  the  kth  subset  being  of  size  n^,  (k»l,...,  rsn;n*n]^f . . .  tn^.) , 
Then,  for  given  computed  for  the  kth 


L 


subset  by  the  use  of  (2).  Let  y.  be  the  Jth  largest  of  x(<<  |n  *) 

•  X  X  f 

. . .  ,x(<<^|nr) .  Then  th?  ralues  of  Pr(y^<f)  and  Pr(y^cfcy^)  can  be 
determined  from  elcaentary  probability  considerations  by  using  the 
values  of  the  Pr  J  and  the  fact  that  the  subsets  are 

Independent.  By  appropriate  clioices  of  r,  the  the^^,  u»  aad  v, 
significance  tests  with  a  wide  variety  of  suitable  significance 
levels  can  frequently  be  found. 

The  subset  tests  considered  here  are  restricted  to  t'^e  following: 


(5)  Accept  if  max  ;krl. 

The  significance  level  of  this  test  equals 

The  one-sided  test  of  f  y  f^i§ 


(6) 


(7)  Accept  if  mln[x(n^fl-e^|nj^);k-l,.,.,^>^^  , 

The  significance  level  of  this  test  is  also  equal  to  (6). 

The  equal  tall  teat  ot  is  defined  by 


(8) 


Accept  if  either  Bax 
or  min  {~x(n.^f  1-^^  |  n^)  ik»l^ . « » ^  ^  ^  . 


The  significance  level  of  this  test  is  equal  to  twice  the  value 
of  (6). 

3.  Significance  levels.  Examination  of  the  results  of  section 
2  shows  that  the  significance  levels  of  all  of  the  tests  (1),  (3)f 
(4)#  (5),  (7),  (8)  are  determined  if  the  significance  levels  of 
tests  (1)  and  (5)  wre  known.  Thus  it  la  sufficient  to  restrict 
significance  level  considerations  to  tests  (l)  sod  (5)« 
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Frono  a  practical  Tlewpolnt,  the  l®portant  significance  levels 
for  one-sided  tests  are  In  the  .05  -  .005  range.  Table  1  contains 
a  li:t  of  the  tests  of  type  (1)  which  hare  significance  levels  near 
this  range  for  nil5.  It  Is  seen  that  suitable  significance  levels 
are  not  avallatle  for  nc4.  Also  there  Is  a  very  limited  choice  of 
satisfactory  levels  for  all  values  of  n  from  4  to  15  Inclusive. 

If  the  n  observations  are  drawn  as  subsets,  suitable  signifi¬ 
cance  levels  are  still  not  avalla  le  for  nC4.-  ?or  nt6,  however,  a 
greater  variety  of  satisfactory  levels  can  be  obtained.  In  practice 
the  .05,  .025,  .01,  .005  significance  levels  are  of  particular  Im¬ 
portance.  Table  2  shoi^s  how  closely  these  levels  can  be  approximated 
for  test  (5).  The  approximations  can  be  made  very  close  for  ii»12 
and  at  least  as  good  for  n>12. 

Thus,  if  the  observations  ere  not  drawn  as  subsets,  the  number 
of  suitable  significance  levels  Is  very  limited.  Drawing  tne  otserva 
tlcns  as  subsets  furnlsnes  many  more  satisfactory  significance  levels 
Examination  of  the  power  efficiencies  listed  In  Tables  1  ana  2  snows, 
however,  that  noticeable  efficiency  can  be  lost  by  using  the  subset 
Qrawlr.g  procedure  11  the  observations  are  a  aample  from  a  normal 
pop  jlaCl on. 

4-  Po^er  efficient.'  uerlvEtlons.  The  power  efficiency  of  a 

significance  tert  Is  ceflnod  In  [ll3-  Essentially  the  power  efflcl- 

test 

ency  of  a  slgnlf Icance/yequals  100  m/c  %,  where  n  Is  the  sample  size 
for  the  given  test  and  a  is  the  sample  size  (not  necessarily  Inte¬ 
gral)  of  the  corresponding  most  powerful  test  at  the  same  signifi¬ 
cance  level  whose  power  function  le  approximately  the  same  as  that 
of  the  given  test. 
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For  oneoflided  t«st8  •/■■•trlcsl  t«ttt  of 

f  i  ft  Bost  poverfuL  test  for  a  ssaple  froa  a  normal  population 
(orJfnown  variance)  is  the  appropriate  Student  t-test.  Also  it  it 
sufficient  to  limit  investigations  to  the  one-’tided  tests  (1)  and 
(5).  As  shown  in  Q3,  tests  (3)  and  (4)  have  the  same  power 
efficiency  as  (1)  while  tests  (7)  and  (8)  have  the  saDe  power 
efficiency  as  test  (5). 

The  power  efficiencies  listed  in  Table  1  were  obtained  from 
[^1,  Table  ^  and  will  not  be  derived  here. 

Let  the  aoriiial  population  have  variance (T^  and  consider  test  (5) 


Power  Function  = 


“kl _ 

(njj-s)ltl 


wr.ere 


i--  (/j-r)/». 


Tr.e  power  function  values  for  test  (5)  listed  in  Table  3  were 
coxpjted  frco  (9) .  The  correspondlr^  t-test  power  function  values 
were  obtained  by  using  the  normal  approxlma tl cn  given  In  .  The 
power  efficiencies  listed  in  Table  2  were  obtained  from  the  results  of 
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Table  3. 

Examination  of  Tables  1  and  2  shots  that  a  substantial  amount  of 
Information  can  be  lost  by  using  a  test  of  the  form  (5)  rather  t^'^n 
of  the  form  (l) .  The  asymptotic  results  of  ^3^  show  that  test  (l) 
is  always  at  least  63.7^  efficient.  Some  of  the  tests  of  Table  2, 
however,  are  only  slightly  more  tnan  efficient.  On  the  other 
hand,  some  tests  of  type  (5)  have  approximately  the  same  power 
efficiency  at  the  corresponding  types  (1)  tests.  For  example,  the 
test 

Accept  JU[  max  [x(8|  9)  ,x(2 133<  fo 

comperes  very  favorably  with  the  corresponding  type  (1)  tests. 

TJ'ils  shows  that  the  use  of  test  (5)  should  not  be  ccipletely  avoided 
but  thst  the  power  efficiency  of  a  test  of  this  type  should  be 
investigeted  before  that  test  is  considered  for  application. 

Tests  having  the  same  significance  level  and  based  on  the 
same  total  number  of  observations  can  have  a  wide  variety  of  power 
efilclencles  depe  ding  on  the  way  in  which  the  test  is  formec. 

Consider  the  case  of  significance  level  .0175  and  n  ■  15.  Table 
4  contains  tne  power  efficiencies  of  six  tests  formed  In  different 
ways.  Tnese  tests  vary  from  tc  lui  efficient.  Use  of  the  test 

Accept  f<,  if  max  []x(4|7),x(l|2)  ,x(l|2)  ,x(4|4y) 

results  in  the  loss  of  8.7  sample  values  while  only  3.85  sample 
values  ere  lost  by  using  the  test 


Accept  f<fo^  x(l2|l5)  <f^  . 
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It  Is  to  be  obserred  that  tii»  above  pover  efficiency 
Investigations  ere  based  on  the  assuaptlon  that  the  n  observations 
ere  a  sample  from  a  norm-1  population.  If  there  is  no  reason  to 
believe  tr^t  the  n  observations  ccie  from  the  same  normal  population, 
the  pover  efficiencies  listed  In  Tables  1  and  2  nay  '-e  far  from  the 
true  values.  The  te-^ts  formed  by  drawing  the  observations  as  sub¬ 
sets  might  be  very  efficient  for  certain  non-normal  situations. 
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TABI£  1 


Type  (1)  Tests  ulth  Sijyiificance  Levels  Near  the  ,05  -  ,005  R^nge 


Test 

Sicnlf- 

icanco 

Level 

Approx.  1 
Effi-  1 
cioncy 

x(4l4)< 

.0625 

95^ 

x(5l5)<  <p. 

.0312 

'  965«  . 

x(6j6)-«:. 

.0156 

95% 

x(7l7)4 

,0078 

95% 

x(a|8)<9^ 

,0039 

95% 

x(8l9)<.  91. 

.0195 

d2% 

x(9|10)-C9„ 

.0107 

aoi 

x(10lll)<9. 

.0059 

ei% 

x(10|l2)<.,;p^ 

.0193 

15% 

x(U|13)4^, 

.0112 

15$ 

x(12|l4)<  9), 

.0065 

i&%  n 

x(  13115)4  9). 

.0037 

78^  1 

Test 


x(7|8)<^. 


x(8|l0)49^ 


x(9|U)<9, 


Sienif- 

icance 

Level 


,0625 


.0352 


,0547 


.0327 


x(10|13)  C?.  .0^62 


x(U|lA)<.?.  .0287 


i(12|15)<.<>,  .0175 


Approx, 

Kfft- 

cioncy 


Typ«  (5)  Toata  Near  the  .05,  . 025,  .01,  .005  Si^ificance  Levels 


TAnir.  3  iCrnc'icilci) 


Icarice  Test 

S;iCi;  le 

Zize 

A}  {  rox. 
hffi- 
ciency 

SirT'if- 

icanc« 

Level 

Aj.proxi r.<'jte  Val  '^e 

Vi  cf  icn->: 

m 

S  =.6 

^  «1.2 

8 

.0049 

.089 

.451 

.  864 

.  990 

ji-az'  xv'l  1  3}  ,Jc(3 

12 

G6i 

.0049 

.115 

.487 

.82i. 

.9:ii 

t 

8.75 

.0098 

.184 

.',00 

.974 

' 

Xv8j9;,x(2 

Pj<?* 

12 

rat, 

.0098 

.200 

.699 

.959 

t 

5.5 

.0254 

.189 

.bib 

.696 

.^90 

1  1 

x(2'  i»;,x{2; 
1  ^  ' 

12 

u% 

.0254 

.220 

.588 

.859 

.965 

t 

6 

.0508 

.353 

.811 

.933 

1 

f - 

- ! 

1 

l_  * 

7J<^^ 

12 

bOi 

.0508 

.376 

.808 

.973 

1 

_  J 

•canct 


